NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Reasoning Beyond Accuracy: Expert Evaluation of Large Language Models in Diagnostic Pathology

https://doi.org/10.1101/2025.04.11.25325686

Waqas, Asim; Ullah, Ehsan; Khan, Asma; Khalil, Farah; Ozturk, Zarifa Gahramanli; Chumbalkar, Vaibhav; Saeed-Vafa, Daryoush; Jameel, Zena; Chen, Weishen; Trejo_Bittar, Humberto; et al (April 2025, medRxiv)

Abstract BackgroundDiagnostic pathology depends on complex, structured reasoning to interpret clinical, histologic, and molecular data. Replicating this cognitive process algorithmically remains a significant challenge. As large language models (LLMs) gain traction in medicine, it is critical to determine whether they have clinical utility by providing reasoning in highly specialized domains such as pathology. MethodsWe evaluated the performance of four reasoning LLMs (OpenAI o1, OpenAI o3-mini, Gemini 2.0 Flash Thinking Experimental, and DeepSeek-R1 671B) on 15 board-style open-ended pathology questions. Responses were independently reviewed by 11 pathologists using a structured framework that assessed language quality (accuracy, relevance, coherence, depth, and conciseness) and seven diagnostic reasoning strategies. Scores were normalized and aggregated for analysis. We also evaluated inter-observer agreement to assess scoring consistency. Model comparisons were conducted using one-way ANOVA and Tukey’s Honestly Significant Difference (HSD) test. ResultsGemini and DeepSeek significantly outperformed OpenAI o1 and OpenAI o3-mini in overall reasoning quality (p < 0.05), particularly in analytical depth and coherence. While all models achieved comparable accuracy, only Gemini and DeepSeek consistently applied expert-like reasoning strategies, including algorithmic, inductive, and Bayesian approaches. Performance varied by reasoning type: models performed best in algorithmic and deductive reasoning and poorest in heuristic and pattern recognition. Inter-observer agreement was highest for Gemini (p < 0.05), indicating greater consistency and interpretability. Models with more in-depth reasoning (Gemini and DeepSeek) were generally less concise. ConclusionAdvanced LLMs such as Gemini and DeepSeek can approximate aspects of expert-level diagnostic reasoning in pathology, particularly in algorithmic and structured approaches. However, limitations persist in contextual reasoning, heuristic decision-making, and consistency across questions. Addressing these gaps, along with trade-offs between depth and conciseness, will be essential for the safe and effective integration of AI tools into clinical pathology workflows.
more » « less
Free, publicly-accessible full text available April 12, 2026
Cavity-Enhanced Raman Scattering from 2D Hybrid Perovskites

https://doi.org/10.1021/acs.jpcc.2c01577

Singh, Aditya; Lynch, Jason; Anantharaman, Surendra B.; Hou, Jin; Singh, Simrjit; Kim, Gwangwoo; Mohite, Aditya D.; Singh, Rajendra; Jariwala, Deep (July 2022, The Journal of Physical Chemistry C)

Full Text Available
Review and comparison of layer transfer methods for two-dimensional materials for emerging applications

https://doi.org/10.1039/D1CS00706H

Schranghamer, Thomas F.; Sharma, Madan; Singh, Rajendra; Das, Saptarshi (October 2021, Chemical Society Reviews)

Two-dimensional (2D) materials offer immense potential for scientific breakthroughs and technological innovations. While early demonstrations of 2D material-based electronics, optoelectronics, flextronics, straintronics, twistronics, and biomimetic devices exploited micromechanically-exfoliated single crystal flakes, recent years have witnessed steady progress in large-area growth techniques such as physical vapor deposition (PVD), chemical vapor deposition (CVD), and metal–organic CVD (MOCVD). However, use of high growth temperatures, chemically-active growth precursors and promoters, and the need for epitaxy often limit direct growth of 2D materials on the substrates of interest for commercial applications. This has led to the development of a large number of methods for the layer transfer of 2D materials from the growth substrate to the target application substrate with varying degrees of cleanliness, uniformity, and transfer-related damage. This review aims to catalog and discuss these layer transfer methods. In particular, the processes, advantages, and drawbacks of various transfer methods are discussed, as is their applicability to different technological platforms of interest for 2D material implementation.
more » « less
Full Text Available
Transistors based on two-dimensional materials for future integrated circuits

https://doi.org/10.1038/s41928-021-00670-1

Das, Saptarshi; Sebastian, Amritanand; Pop, Eric; McClellan, Connor J.; Franklin, Aaron D.; Grasser, Tibor; Knobloch, Theresia; Illarionov, Yury; Penumatcha, Ashish V.; Appenzeller, Joerg; et al (November 2021, Nature Electronics)

Full Text Available
Effect of fractionally damped compliance elements on amplitude sensitive dynamic stiffness predictions of a hydraulic bushing

https://doi.org/10.1016/j.ymssp.2018.04.031

Fredette, Luke; Singh, Rajendra (November 2018, Mechanical Systems and Signal Processing)

Full Text Available
Identification of multi-dimensional elastic and dissipative properties of elastomeric vibration isolators

https://doi.org/10.1016/j.ymssp.2018.09.009

Ramesh, Ram S.; Fredette, Luke; Singh, Rajendra (March 2019, Mechanical Systems and Signal Processing)

Full Text Available
Explanation for Variability in Lower Frequency Structure-Borne Noise and Vibration: Roles of Rear Subframe Dynamics and Right-Left Spindle Phasing

https://doi.org/10.4271/10-02-01-0002

Rengarajan, Revathi; Noll, Scott; Singh, Rajendra (February 2018, SAE International Journal of Vehicle Dynamics, Stability, and NVH)

Full Text Available
Nonlinear fluid damping models for hydraulic bushing under sinusoidal or transient excitation

https://doi.org/10.1177/0954407017751787

Fredette, Luke; Rath, Siddharth; Singh, Rajendra (December 2017, Proceedings of the Institution of Mechanical Engineers, Part D: Journal of Automobile Engineering)

Full Text Available
Nonlinear load-deflection and stiffness characteristics of coned springs in four primary configurations

https://doi.org/10.1016/j.mechmachtheory.2017.06.006

Mastricola, Nicholas P.; Singh, Rajendra (October 2017, Mechanism and Machine Theory)

Full Text Available
Step responses of a torsional system with multiple clearances: Study of vibro-impact phenomenon using experimental and computational methods

https://doi.org/10.1016/j.ymssp.2017.05.043

Oruganti, Pradeep Sharma; Krak, Michael D.; Singh, Rajendra (January 2018, Mechanical Systems and Signal Processing)

Full Text Available

« Prev Next »

Search for: All records